Audio responsive augmented reality

ABSTRACT

Systems and methods for receiving audio data; identifying one or more graphical interface elements that correspond to the audio data; generating a display of the identified one or more graphical interface elements, wherein a first portion of the one or more graphical interface elements is persistently displayed, and wherein a second portion of the one or more graphical interface elements is temporarily displayed for a predetermined period of time together with the first portion of the one or more graphical interface elements; and at expiry of the predetermined period of time, ceasing display of the second portion while maintaining display of the first portion.

CLAIM OF PRIORITY

This application is a continuation of and claims the benefit of priorityof U.S. patent application Ser. No. 16/528,273, filed on Jul. 31, 2019,which is a continuation of and claims the benefit of priority of U.S.patent application Ser. No. 15/666,349, filed on Aug. 1, 2017, whichclaims the benefit of priority of U.S. Provisional Patent ApplicationSer. No. 62/369,643, filed on Aug. 1, 2016, each of which are herebyincorporated by reference herein in their entireties.

TECHNICAL FIELD

Embodiments of the present disclosure relate generally to automatedprocessing of video and audio streams. More particularly, but not by wayof limitation, the present disclosure addresses systems and methods forgenerating persistent graphical representations within a video streamand responsive to audio within the video stream.

BACKGROUND

Telecommunications applications and devices can provide communicationbetween multiple users using a variety of media, such as text, images,sound recordings, and/or video recording. For example, videoconferencing allows two or more individuals to communicate with eachother using a combination of software applications, telecommunicationsdevices, and a telecommunications network. Telecommunications devicesmay also record video streams to transmit as messages across atelecommunications network.

Although telecommunications applications and devices exist to providetwo-way video communication between two devices, there can be issueswith video streaming, such as modifying images within the video streamduring pendency of a communication session. Telecommunications devicesuse physical manipulation of the device in order to perform operations.For example, devices are typically operated by changing an orientationof the device or manipulating an input device, such as a touchscreen.Accordingly, there is still a need in the art to improve videocommunications between devices and modifying video streams in real timewhile the video stream is being captured.

BRIEF DESCRIPTION OF THE DRAWINGS

The patent or application file contains at least one drawing executed incolor. Copies of this patent or patent application publication withcolor drawing(s) will be provided by the Office upon request and paymentof the necessary fee.

Various ones of the appended drawings merely illustrate exampleembodiments of the present disclosure and should not be considered aslimiting its scope.

FIG. 1 is a block diagram illustrating a networked system, according tosome example embodiments.

FIG. 2 is a diagram illustrating an augmented reality system, accordingto some example embodiments.

FIG. 3 is a flow diagram illustrating an example method for generatingresponsive augmented reality elements within a graphical user interface,according to some example embodiments.

FIG. 4 is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 4A is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 5 is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 5A is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 6 is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 6A is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 7 is a flow diagram illustrating an example method for generatingresponsive augmented reality elements within a graphical user interface,according to some example embodiments.

FIG. 8 a flow diagram illustrating an example method for generatingresponsive augmented reality elements within a graphical user interface,according to some example embodiments.

FIG. 9 is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 10 is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 11 a flow diagram illustrating an example method for generatingresponsive augmented reality elements within a graphical user interface,according to some example embodiments.

FIG. 12 is a user interface diagram depicting an example mobile deviceand mobile operating system interface, according to some exampleembodiments.

FIG. 13 is a block diagram illustrating an example of a softwarearchitecture that may be installed on a machine, according to someexample embodiments.

FIG. 14 is a block diagram presenting a diagrammatic representation of amachine in the form of a computer system within which a set ofinstructions may be executed for causing the machine to perform any ofthe methodologies discussed herein, according to an example embodiment.

FIG. 15 is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

FIG. 16 is a user interface diagram depicting the augmented realitysystem in operation, according to some example embodiments.

The headings provided herein are merely for convenience and do notnecessarily affect the scope or meaning of the terms used.

DETAILED DESCRIPTION

The description that follows includes systems, methods, techniques,instruction sequences, and computing machine program productsillustrative of embodiments of the disclosure. In the followingdescription, for the purposes of explanation, numerous specific detailsare set forth in order to provide an understanding of variousembodiments of the inventive subject matter. It will be evident,however, to those skilled in the art, that embodiments of the inventivesubject matter may be practiced without these specific details. Ingeneral, well-known instruction instances, protocols, structures, andtechniques are not necessarily shown in detail.

An augmented reality system is described that immerses the user in aresponsive real time environment which automatically modifiesvisualizations, graphical representations, and other aspects of theaugmented environment based on changing conditions in the real world asreceived through video and audio signals. In various exampleembodiments, the augmented reality system generates a responsive realtime graphic equalizer which adapts to changing audio input of the realworld environment. In some embodiments, the experience of the augmentedreality system may be optimized for display during evening hours. Insome instances, aspects of the video stream may be modified to changereal world environment to optimize the augmented reality environment fordisplay of certain graphical representations.

The real time graphical equalizer may be generated as a virtual auroraldisplay. The virtual auroral display imitates natural auroral displayssuch as the aurora borealis. The real time graphical equalizer may alsobe generated as an inflatable light sculpture, particles, shapes,colors, filters, objects, or other graphical representations. Thegraphical representations, generated as a part of the real timegraphical equalizer may be rendered as three-dimensional elements. Forexample, particles (e.g., bubbles, stars, twinkling lights, lines ofauroral light) may be generated as a three-dimensional particle havingmodulating characteristics based on the audio/visual input of thereal-world environment. In some instances, the modulatingcharacteristics include colors, shapes, color intensities, opacityvalues, sizes, velocities, and other suitable characteristics.

The augmented reality system may provide users with an augmented realityreceived through a combination of input devices of a client device, suchas a camera and a microphone. In some embodiments, responsive to audiodata received through the microphone of the client device, the augmentedreality system may automatically modify the modulating characteristicsof graphical representations, particles, auroras, or other elementsgenerated and rendered within the augmented real time environment. Wherethe graphical representations are auroras, the augmented reality systemmay generate glowing, moving lines of light that resemble the auroraborealis which fade into view above a person depicted within a videostream. Graphical elements may also be generated as particle animationsappearing and bouncing around the person in time to a sound within theaudio data.

In some example embodiments, at least a portion of the augmented realitysystem may be implemented on a client device of a user. The user opensan application, via an input device, stored on the client device. Oncethe application opens, initiating the augmented reality system, the usermoves the client device to generate an orientation change. For example,the user may move the client device upward towards the sky. The user maythen perform an extended user input (e.g., press and hold atouchscreen). Based on the change in orientation of the client deviceand upon the user input exceeding a time threshold, the augmentedreality system may initiate a visualization of a real time augmentedenvironment. The augmented reality system may identify data from thevideo stream (e.g., audio or video data) and generate graphical orvisual elements. The augmented reality system may then render thegraphical or visual elements within a video stream received through thecamera of the client device and presents the graphical or visualelements on a display device of the client device in the real timeaugmented environment.

For example, in some embodiments, the user opens the application on asmartphone and moves the smartphone skyward. Opening the application mayinitiate a video stream from a camera of the smartphone and display thevideo stream in real time on a display device of the smartphone. Theuser then presses and holds a touchscreen on the smartphone to initiatethe augmented reality system. The augmented reality system may unlock adynamic light visualization (e.g., an aurora) in a portion of the videostream such that the dynamic light visualization is displayed over atleast a portion of the sky depicted within the video stream. Theaugmented reality system may analyze audio signals or data receivedthrough a microphone of the smartphone and cause changes to the dynamiclight visualization (e.g., movement in the aurora; new particles;changes in size, shape, and speed of the particles) and display thechanges in the dynamic light visualization in real time. When the usermoves the smartphone to capture additional elements in real worldsurroundings, the augmented reality system may maintain display (e.g.,including periodic or real time changes) of at least a portion of thedynamic light visualization where the sky remains visible in one or moreframes of the video stream. In some example embodiments, when the usermoves the smartphone in a manner that temporarily removes the sky fromframes of the video stream, once a portion of the sky reenters the videostream, the augmented reality system resumes display and modification ofthe dynamic light visualization in the portion of the video streamdepicting the sky.

The above are some specific example embodiments. The various embodimentsof the present disclosure relate to devices and instructions by one ormore processors of a device to modify an image or a video streamtransmitted by the device to another device while the video stream isbeing captured (e.g., modifying a video stream in real time). Anaugmented reality system is described that identifies and tracks objectsand areas of interest within an image or across a video stream andthrough a set of images comprising the video stream. In various exampleembodiments, the augmented reality system generates and modifies visualelements within the video stream based on data captured from thereal-world environment in captured by within the video stream and anaccompanying audio stream.

FIG. 1 is a network diagram depicting a network system 100 having aclient-server architecture configured for exchanging data over anetwork, according to one embodiment. For example, the network system100 may be a messaging system where clients communicate and exchangedata within the network system 100. The data may pertain to variousfunctions (e.g., sending and receiving text and media communication,determining geolocation, etc.) and aspects (e.g., transferringcommunications data, receiving and transmitting indications ofcommunication sessions, etc.) associated with the network system 100 andits users. Although illustrated herein as client-server architecture,other embodiments may include other network architectures, such aspeer-to-peer or distributed network environments.

As shown in FIG. 1 , the network system 100 includes a social messagingsystem 130. The social messaging system 130 is generally based on athree-tiered architecture, consisting of an interface layer 124, anapplication logic layer 126, and a data layer 128. As is understood byskilled artisans in the relevant computer and Internet-related arts,each component or engine shown in FIG. 1 represents a set of executablesoftware instructions and the corresponding hardware (e.g., memory andprocessor) for executing the instructions, forming ahardware-implemented component or engine and acting, at the time of theexecution of instructions, as a special purpose machine configured tocarry out a particular set of functions. To avoid obscuring theinventive subject matter with unnecessary detail, various functionalcomponents and engines that are not germane to conveying anunderstanding of the inventive subject matter have been omitted fromFIG. 1 . Of course, additional functional components and engines may beused with a social messaging system, such as that illustrated in FIG. 1, to facilitate additional functionality that is not specificallydescribed herein. Furthermore, the various functional components andengines depicted in FIG. 1 may reside on a single server computer orclient device, or may be distributed across several server computers orclient devices in various arrangements. Moreover, although the socialmessaging system 130 is depicted in FIG. 1 as a three-tieredarchitecture, the inventive subject matter is by no means limited tosuch an architecture.

As shown in FIG. 1 , the interface layer 124 consists of interfacecomponents (e.g., a web server) 140, which receives requests fromvarious client-computing devices and servers, such as client devices 110executing client application(s) 112, and third party servers 120executing third party application(s) 122. In response to receivedrequests, the interface component 140 communicates appropriate responsesto requesting devices via a network 104. For example, the interfacecomponents 140 can receive requests such as Hypertext Transfer Protocol(HTTP) requests, or other web-based, Application Programming Interface(API) requests.

The client devices 110 can execute conventional web browser applicationsor applications (also referred to as “apps”) that have been developedfor a specific platform to include any of a wide variety of mobilecomputing devices and mobile-specific operating systems (e.g., IOS™,ANDROID™, WINDOWS® PHONE). Further, in some example embodiments, theclient devices 110 form all or part of an augmented reality system 160such that components of the augmented reality system 160 configure theclient device 110 to perform a specific set of functions with respect tooperations of the augmented reality system 160.

In an example, the client devices 110 are executing the clientapplication(s) 112. The client application(s) 112 can providefunctionality to present information to a user 106 and communicate viathe network 104 to exchange information with the social messaging system130. Further, in some examples, the client devices 110 executefunctionality of the augmented reality system 160 to segment images ofvideo streams during capture of the video streams and transmit the videostreams (e.g., with image data modified based on the segmented images ofthe video stream).

Each of the client devices 110 can comprise a computing device thatincludes at least a display and communication capabilities with thenetwork 104 to access the social messaging system 130, other clientdevices, and third party servers 120. The client devices 110 comprise,but are not limited to, remote devices, work stations, computers,general purpose computers, Internet appliances, hand-held devices,wireless devices, portable devices, wearable computers, cellular ormobile phones, personal digital assistants (PDAs), smart phones,tablets, ultrabooks, netbooks, laptops, desktops, multi-processorsystems, microprocessor-based or programmable consumer electronics, gameconsoles, set-top boxes, network PCs, mini-computers, and the like. User106 can be a person, a machine, or other means of interacting with theclient devices 110. In some embodiments, the user 106 interacts with thesocial messaging system 130 via the client devices 110. The user 106 maynot be part of the networked environment, but may be associated with theclient devices 110.

As shown in FIG. 1 , the data layer 128 has database servers 132 thatfacilitate access to information storage repositories or databases 134.The databases 134 are storage devices that store data such as memberprofile data, social graph data (e.g., relationships between members ofthe social messaging system 130), image modification preference data,accessibility data, and other user data.

An individual can register with the social messaging system 130 tobecome a member of the social messaging system 130. Once registered, amember can form social network relationships (e.g., friends, followers,or contacts) on the social messaging system 130 and interact with abroad range of applications provided by the social messaging system 130.

The application logic layer 126 includes various application logiccomponents 150, which, in conjunction with the interface components 140,generate various user interfaces with data retrieved from various datasources or data services in the data layer 128. Individual applicationlogic components 150 may be used to implement the functionalityassociated with various applications, services, and features of thesocial messaging system 130. For instance, a social messagingapplication can be implemented with of the application logic components150. The social messaging application provides a messaging mechanism forusers of the client devices 110 to send and receive messages thatinclude text and media content such as pictures and video. The clientdevices 110 may access and view the messages from the social messagingapplication for a specified period of time (e.g., limited or unlimited).In an example, a particular message is accessible to a message recipientfor a predefined duration (e.g., specified by a message sender) thatbegins when the particular message is first accessed. After thepredefined duration elapses, the message is deleted and is no longeraccessible to the message recipient. Of course, other applications andservices may be separately embodied in their own application logiccomponents 150.

As illustrated in FIG. 1 , the social messaging system 130 may includeat least a portion of the augmented reality system 160 capable ofidentifying, tracking, and modifying video data during capture of thevideo data by the client device 110. Similarly, the client device 110includes a portion of the augmented reality system 160, as describedabove. In other examples, client device 110 may include the entirety ofaugmented reality system 160. In instances where the client device 110includes a portion of (or all of) the augmented reality system 160, theclient device 110 can work alone or in cooperation with the socialmessaging system 130 to provide the functionality of the augmentedreality system 160 described herein.

In some embodiments, the social messaging system 130 may be an ephemeralmessage system that enables ephemeral communications where content (e.g.video clips or images) are deleted following a deletion trigger eventsuch as a viewing time or viewing completion. In such embodiments, adevice uses the various components described herein within the contextof any of generating, sending, receiving, or displaying aspects of anephemeral message. For example, a device implementing the augmentedreality system 160 may identify, track, and modify an object ofinterest, such as pixels representing skin on a face depicted in thevideo clip. The device may modify the object of interest during captureof the video clip without image processing after capture of the videoclip as a part of a generation of content for an ephemeral message.

In FIG. 2 , in various embodiments, the augmented reality system 160 canbe implemented as a standalone system or implemented in conjunction withthe client device 110, and is not necessarily included in the socialmessaging system 130. The augmented reality system 160 is shown toinclude an access component 210, a coordinate component 220, an analysiscomponent 230, a generation component 240, an orientation component 250,a modification component 260, a tracking component 270, and apresentation component 280. All, or some, of the components 210-280,communicate with each other, for example, via a network coupling, sharedmemory, and the like. Each component of components 210-280 can beimplemented as a single component, combined into other components, orfurther subdivided into multiple components. Other components notpertinent to example embodiments can also be included, but are notshown.

FIG. 3 depicts a flow diagram illustrating an example method 300 forgenerating responsive augmented reality elements within a graphical userinterface, according to some example embodiments. The augmented realityelements may be generated and rendered or otherwise depicted in a videostream while the video stream is being captured by the client device110. The operations of method 300 may be performed by components of theaugmented reality system 160, and are so described below for purposes ofillustration.

In operation 310, the access component 210 receives or otherwiseaccesses a video stream. The video stream may comprise one or moreframes containing image data and audio data. The one or more inputdevices may include a camera, image capture device, video capturedevice, a microphone, and any other suitable input devices capable ofcapturing the video stream and audio data.

In some embodiments, the augmented reality system 160 may be preloadedor pre-cached on the client device 110 and may be triggered by acombination of user input and location. For example, the display aspectsof the augmented reality system 160 may be limited to a predeterminedlocation, geographical region, street address, set of coordinates, ordistances extending from one or more of the former. By way of furtherexample, the augmented reality system 160 may be georeferenced and timereferenced such that a period (e.g., one day) ahead of a scheduled time,at least a portion of the augmented reality system 160 may be pre-cachedon the client device 110. Client devices 110 which receive thepre-caching may be a set distance from a location (e.g., fifty-six milesfrom the Santa Monica Pier). The augmented reality system 160 may beactivated within an ephemeral messaging system proximate to thescheduled time. In some instances, the augmented reality system 160 maybe activated for users who engage in one or more other aspect of theephemeral messaging system during a predetermined previous time period.

In operation 320, the coordinate component 220 identifies a set ofcoordinates within a portion of the one or more frames of the videostream. Upon capturing the video stream, the augmented reality system160 may identify the set of coordinates within a portion of the one ormore frames of the video stream. The set of coordinates may bedetermined using external information, landmarks within the image,generated and overlaid without regard to aspects of the depicted image,or any other suitable manner.

In operation 330, the analysis component 230 identifies one or moreaudio characteristics within the audio data of the video stream. Theanalysis component 230 may identify the audio characteristics afteridentifying the set of coordinates. The one or more audiocharacteristics may include one or more components of the audio datawhich may be analyzed by the analysis component 230. In someembodiments, the one or more audio characteristics comprise one or morefrequencies, a volume, one or more time signatures, one or more tempos,one or more beats, one or more timbre, one or more musical instruments,one or more voices, one or more keys, one or more pitches, or any otheraspect of audio data which may be analyzed to differentiate between twoor more audio streams, types of audio (e.g., ambient noise and music),or any other sound received from a sound capturing input device.

In operation 340, the generation component 240 generates one or moregraphical interface elements (e.g., augmented reality elements) based onthe one or more audio characteristics within the audio data. In someembodiments, the generation component 240 generates the graphicalinterface elements in response to the analysis component 230 identifyingthe one or more audio characteristics. The one or more graphicalinterface elements may be generated in a first portion of the videostream (e.g., a portion of the frames of the video stream). In someembodiments, the first portion of the video stream in which thegraphical elements are generated and presented follows a portion of thevideo stream after one or more of the coordinates being identified, theaudio characteristics being identified, and the graphical elements beinggenerated.

For example, as shown in FIG. 4 , the one or more graphical elements maybe positioned within the video stream, in real time, as the video streamis being captured. In FIG. 4 , the graphical elements 400 may include afirst set of graphical elements 410 depicted in a first region 412 and asecond set of graphical elements 420. The first set of graphicalelements 410 may be or combine to form an aurora within the video streampositioned within a predefined but variable area of one or more framesof the video stream. The aurora may be defined within the first region412 by a density, intensity, or color range of the first set ofgraphical elements 410. As shown in color in FIG. 4A, the first set ofgraphical elements 410 is depicted as an aqua or light blue hue, anaurora, in the first region 412. As shown in FIG. 4 , the first regionis generally defined by a dotted line as a representation of a colorbarrier. As shown in FIG. 4A, the first region 412 is shown as a blue oraqua colored light with a first intensity and opacity. A second region414 is shown in FIG. 4 as defined by a second dotted region. In FIG. 4A,the second region 414 includes blue light having a differing hue andopacity than the first region 412. Although FIG. 4A shows the first setof graphical elements 410 as aqua and the first region 412 as the aquacolor in a first intensity and hue and the second region 414 with asecond intensity and hue of blue, the graphical elements may be anysuitable color, such that aqua is a representational color which may bechanged, modified, or replaced with another color, such as a green,pink, yellow, or any other suitable color range. The second set ofgraphical elements 420 may include a set of individual particles of anysuitable shape. As shown in FIG. 4 , the second set of graphicalelements 420 includes a set of circles or dots suspended in the field ofview of the image capture device. The second set of graphical elements420 may be variable in position, size, and movement (e.g., velocity anddirection of movement). For example, as shown in FIG. 4A, the second setof graphical elements 420 may include a plurality of dots, each dothaving a different color, size, diffusion around edges, and position. Asshown in FIG. 4A, a first graphical element 422, of the second set ofgraphical elements 420, is depicted as a yellow dot, having a highopacity and intensity. A second graphical element 424 is shown with asecond color (e.g., rose), a second opacity (e.g., more transparent ortranslucent than the first graphical element 422), and a second edgediffusion. For example, as shown the second graphical element 424 hasmore diffuse edges than shown for the first graphical element 422.

In some embodiments, the graphical elements may be persistent based onthe audio characteristics. The graphical elements, in these instances,may persist within the video stream while an audio characteristicassociated with the graphical element is being received by the audiocapture component. While the graphical elements are present within thevideo stream, components of the augmented reality system 160 may modifythe graphical elements according to behaviors associated with thegraphical elements based on changes in the audio data received by theaudio capture device, as described below in more detail. For example, asshown in FIGS. 4-6 , the first set of graphical elements 410 (e.g., theaurora) is persistent, being presented in multiple frames of the videostream. As shown in FIGS. 4A, 5A, and 6A, the aurora of the first set ofgraphical elements 410 may change shape, color, and size based onchanges in the audio stream. In some embodiments, the persistence of thegraphical elements may define a presentation of a form of the graphicalelement across multiple frames of the video stream, while the persistentgraphical element is variable in its presentation across the frames. Forexample, as shown in FIGS. 4-6 , the first set of graphical elements 410is persistent, presented in both frames of the video stream, and isvariable in color, shape, opacity, and position within the frames. Asshown within FIGS. 4-6 , the first set of graphical elements may changeposition (e.g., concentrating in a top right corner, occupying a topthird of the frame, and occupying a top fourth of the frame), color(e.g., all blue or aqua, to partially aqua and partially dark blue, topartially aqua and partially green), opacity, and shape. Further, asshown in FIGS. 4, 5, and 6 , regions of the first set of graphicalelements 410 may differ over time. As shown in FIG. 4 , the first region412 and the second region 414 are shown at first positions. In FIG. 5 ,changes in the first set of graphical elements 410 cause a modificationof the first region 412 and the second region 414 such that the firstset of graphical elements 410 may be divided into a plurality of regions512, 514, and 516. As shown in FIG. 5A, the color intensity defining theplurality of regions provides bright intensity aqua colors defining athird region 512 and a fourth region 514, of the plurality of regions. Afifth region 516 is shown having a different color hue, a darker bluethan the aqua of the third region 512 and the fourth region 514. Asshown in FIGS. 6 and 6A, the plurality of regions 512, 514, and 516 mayfurther change position and composition. As shown in FIG. 6 , asubsequent set of regions 612, 614, and 616 are shown. In FIG. 6A, thesubsequent set of regions are defined by color variations in the firstset of graphical elements 410. As shown, a sixth region 612 includes aportion of the first set of graphical elements 410 providing a yellow togreen spectrum of color at a first intensity. A seventh region 614includes a portion of the first set of graphical elements 410, having adifferent color than those in the sixth region 612. The portion of thefirst set of graphical elements 410 in the seventh region 614 provide aplurality of blues, from aqua to dark blue, at a first intensity range.The portion of the first set of graphical elements 410 in the eighthregion 616 have a more uniform hue (e.g., aqua) than the portion of thefirst set of graphical elements 410 in the seventh region 614, but havea higher and more uniform intensity, such that a boundary between theseventh region 614 and the eighth region 616 may be viewed as one ormore of an intensity threshold and a hue threshold.

In some embodiments, the graphical elements may be temporary orephemeral based on the audio characteristics. The graphical elements maybe generated for a predetermined period of time and cease presentationwithin the video stream after the period of time has elapsed. Thetemporary graphical elements may be initially generated, disappear, andbe regenerated based on the same audio characteristic being detected andremaining present within the video stream and audio data. In someembodiments, a portion of the one or more graphical elements may bepersistent while a portion of the graphical elements may be temporary.As shown in FIGS. 4-6 , the second set of graphical elements 420,depicted as variable particles, may be temporary elements. For example,the second set of graphical elements 420 depicted in FIGS. 4 and 4A maychange over time to a second set of graphical elements 520, shown inFIGS. 5 and 5A. As shown, a portion of the second set of graphicalelements 420 have changed in size, shape, opacity, position, or color tobe included in the second set of graphical elements 520. In FIG. 4A, thesecond set of graphical elements 420 are shown including the firstgraphical element 422, a small yellow dot, the second graphical element424, a larger diffuse red or rose colored dot, and third graphicalelements 426 formed by a plurality of varying sized white dots having arelatively uniform hue, intensity, and edge diffusion. Modification ofthe graphical elements transforms the second set of graphical elements420 into the second set of graphical elements 520, shown in FIGS. 5 and5A. As shown, a portion of the second set of graphical elements 420 haveterminated and been replaced by elements of the second set of graphicalelements 520. As shown in FIGS. 4, 4A, 5, and 5A, the second set ofgraphical elements 520 include graphical elements 522, 524, and 526,among others. The graphical element 522, a diffuse magenta colored dot;the graphical element 524, diffuse golden dot; and the graphical element526, a diffuse violet dot, may be included in the new portion of asecond set of graphical elements 520. These graphical elements 522, 524,and 526 represent graphical elements having a different shape,character, and color range than have been generated and populated withinthe video stream in previous iterations of the second set of graphicalelements. As shown in FIGS. 6 and 6A, the second set of graphicalelements 420 or 520 may further be modified into the second set ofgraphical elements 620 with differing shapes, patterns, or fillcharacteristics. As shown in FIG. 6 , the second set of graphicalelements 620 are shown as hollow circles. FIG. 6A shows the second setof graphical elements 620 as differing in not only shape and size, butcolor as well. The second set of graphical elements 620 are a pink colornot included in previously generated iterations of the second set ofgraphical elements shown in FIGS. 4A and 5A. Although described withrespect to certain colors (e.g., blue, purple, magenta, pink, yellow,etc.) it should be understood that changes in color may include changesto any suitable color in modifying the graphical elements depicted anddescribed.

In operation 350, the analysis component 230 detects a change in theaudio data within the video stream. In some embodiments, the analysiscomponent 230 analyzes the audio data to determine changes in at leastone of the one or more audio characteristics of the audio data. Forexample, the analysis component 230 may identify a change in volumedetected in the audio data such as a transition within a song during aconcert or a transition between songs at a concert. The analysiscomponent 230 may also detect changes in tempo, instrument compositionof a musical piece, changes in time signature, changes in beat, changesin timbre, changes in frequency, or any other changes within the audiodata.

In operation 360, the modification component 260 modifies the one ormore graphical interface elements in a second portion of the videostream. In some embodiments, the modification component 260 modifies theone or more graphical interface elements in response to the change inthe audio data. In these instances, one or more aspects of the one ormore graphical elements are modified within the second portion of thevideo stream while the graphical element is present within the videostream. In some embodiments, the one or more aspects or visualcharacteristics modified by the augmented reality system 160 comprise atleast one of a color value, a brightness value, an intensity value, anopacity value, a position within at least one frame of the video stream,a size, shape, an oscillation, a pulse, velocity of the graphicalelement, or relative positions of two or more graphical elements, asshown in FIGS. 4-6 and 4A-6A. Although specific aspects have beendescribed or referenced with respect to specific embodiments, it shouldbe understood that the augmented reality system 160 may modify anysuitable aspect of a graphical element present within the video stream.In some instances, modification of the one or more graphical elementsmay include ending presentation (e.g., inclusion in a visible portion ofthe video stream) of a graphical element or generation of an additionalgraphical element.

In some embodiments, the user may end the real time augmented realityenvironment by selecting a graphical interface element. For example, thegraphical interface element may be an “X” or other termination indicatorpositioned at a fixed point within the display device of the clientdevice 110 and depicted as imposed over at least a portion of the videostream.

FIG. 7 depicts a flow diagram illustrating an example method 700 forgenerating responsive augmented reality elements within a graphical userinterface, according to some example embodiments. The operations ofmethod 700 may be performed by components of the augmented realitysystem 160. In some instances, certain operations of the method 700 maybe performed using one or more operations of the method 300 or assub-operations of one or more operations of the method 300, as will beexplained below in more detail. For example, as shown in FIG. 7 , themethod 700 may be performed as part of or in conjunction with operation320 of the method 300.

In operation 710, the orientation component 220 detects a change inorientation of the client device 110. In some instances, the coordinatecomponent 220 detects the change in orientation using one or more of agyroscope, a gyroscopic sensor, an accelerometer, an altitude sensor,proximity sensor, near field communication sensor, a rotation sensor, aglobal positioning system (GPS) sensor, an audio capture component(e.g., a microphone), an image capture component (e.g., a camera orvideo camera), or any other suitable sensor. For example, a gyroscopicsensor within the client device 110 may detect a change in relativeposition of the client device 110 moving upward toward the sky. Althoughdescribed with reference to an upward or skyward change in position, itshould be understood that the change in relative position may be alateral or sideways motion, a rotational position change, or any othersuitable motion.

In operation 720, the coordinate component 220 determines the change inorientation exceeds a predetermined orientation threshold. In someembodiments, the change in orientation includes a vertical component. Insome instances, where the change in orientation includes a verticalcomponent, the predetermined orientation threshold may be a verticalorientation threshold. The predetermined orientation threshold may be adistance (e.g., a vertical, horizontal, or diagonal distance), anangular position (e.g., vertical, horizontal, or diagonal), or any othermeasurement of a change in position of the client device 110.

In operation 730, the coordinate component 220 generates a coordinatemesh within at least a portion of the one or more frames of the videostream. The coordinate mesh may be a set of connected polygons joined atvertices corresponding to individual coordinates. In some embodiments,the coordinate mesh is a grid pattern distributed across at least aportion of the field of view of the image capture component of theclient device 110. In some instances, the set of coordinates may bedisplayed within one or more frames of the video stream, at leasttemporarily, after identification of the set of coordinates. Thecoordinate mesh may also be displayed within one or more frames of thevideo stream. For example, in some instances, once the coordinate meshis generated, the coordinate mesh

FIG. 8 depicts a flow diagram illustrating an example method 800 forgenerating responsive augmented reality elements within a graphical userinterface, according to some example embodiments. The operations ofmethod 800 may be performed by components of the augmented realitysystem 160. In some instances, certain operations of the method 800 maybe performed using one or more operations of one or more of the methods300 or 700 or as sub-operations of one or more operations of one or moreof the methods 300 or 700, as will be explained below in more detail.For example, as shown in FIG. 8 , the method 800 may be performed aspart of or in conjunction with operation 340 of the method 300.

In operation 810, the generation component 240 generates a firstgraphical interface element associated with the one or more audiocharacteristics. In some embodiments, the first graphical interfaceelement is generated with a set of first behaviors associated with atleast one audio characteristic of the one or more audio characteristics.The first graphical interface element may be generated in a mannersimilar to that described above.

In operation 820, the generation component 240 generates a set of secondgraphical interface elements associated with the one or more audiocharacteristics. In some embodiments, the set of second graphicalinterface elements are generated with a set of second behaviorsassociated with at least one audio characteristic of the one or moreaudio characteristics. In some instances, the set of second graphicalinterface elements may be generated in a manner similar to or the sameas described above.

In some embodiments, the behaviors of the set of first behaviors and theset of second behaviors differ from one another. In some embodiments,where the first graphical element is an aurora, the set of firstbehaviors comprise temporary changes in the first graphical elementwhich oscillate between a first state and a second state, as shown inFIGS. 4-6 . In some instances, the set of first behaviors comprise atiming cycle, a path, a position cycle, a size cycle, an opacity cycle,or any other set of actions, changes, or operations coordinating achange between a first state and a second state. In some instances, theset of second graphical elements may be a set of particles, havingvariables positions within the frames of the video stream, as shown inFIGS. 4-6 .

In some embodiments, as part of or in conjunction with operation 360 ofmethod 300, in operation 830, the modification component 260 modifiesone or more visual characteristic of the graphical interface elementaccording to the set of first behaviors. In some instances, themodification of the one or more visual characteristic is performed basedon the change in the audio data corresponding to the set of firstbehaviors. For example, a change in volume or frequency of the audiodata may cause the components of the augmented reality system 160 tomodify the first graphical element (e.g., the aurora) to change opacityvalue, color value, intensity value, or position value between the firststate and the second state. As shown in FIG. 9 , a set of graphicalelements 900 including a first graphical element 910 and a set of secondgraphical elements 920 may be presented within the frames of the videostream based on a current frequency, volume, or other audiocharacteristic in a first state. As shown in FIG. 10 , the firstgraphical element 910 and the set of second graphical elements 920 maybe modified based on changes in the audio characteristic (e.g.,frequency or volume). Where the visual aspect is a regional value,regions shown by dotted lines in FIG. 9 may change to regional divisionsshown in FIG. 10 . Where the visual aspect is a color value, themodification of the graphical element 910, in FIG. 9 , may be a changefrom blue (e.g., the first state), as shown in FIG. 15 , at a firstvolume level (e.g., a decibel level) to red (e.g., the second state) ata second volume level, as shown in FIG. 16 , shown in FIG. 10 . In thisexample, the change between the first state and the second state may bea binary change with no intermediate color values or a gradual changecycling through one or more intermediate color values between blue andred.

The set of second behaviors may be similar to the set of firstbehaviors, such that a change in the audio data causes a modification inan action, visual aspect, or other aspect of at least one of the set ofsecond graphical elements. In some embodiments, the one or more visualcharacteristic of the first graphical interface element is selected froma group consisting of a color value, an opacity value, a brightnessvalue, and a position for the first graphical interface element.

In operation 840, the modification component 260 modifies one or morevisual characteristic of one or more second graphical interface elementsof the set of second graphical interface elements. In some embodiments,modification of the visual characteristics of one or more secondgraphical interface elements is performed according to the set of secondbehaviors and based on the change in the audio data. For example, wherethe set of second graphical elements are particles distributed withinthe aurora, the modification of the one or more second graphicalelements may include moving the one or more graphical elements along apath determined by the set of second behaviors. The set of secondbehaviors may include a path description, a speed of movement, or anyother suitable aspect relating to the position and movement of the oneor more second graphical elements.

In operation 850, the presentation component 280 causes presentation ofthe modified first graphical interface element and the modified one ormore second graphical interface elements within the second portion ofthe video streams.

FIG. 11 depicts a flow diagram illustrating an example method 1100 forgenerating responsive augmented reality elements within a graphical userinterface, according to some example embodiments. The operations ofmethod 1100 may be performed by components of the augmented realitysystem 160. In some instances, certain operations of the method 1100 maybe performed using one or more operations of one or more of the methods300, 700, or 800 or as sub-operations of one or more operations of oneor more of the methods 300, 700, or 800, as will be explained below inmore detail.

In operation 1110, the tracking component tracks one or more positionsof the one or more graphical interface elements across the one or moreframes of the video stream. In some embodiments, operation 1110 isperformed using one or more sub-operations, as described in more detailbelow.

In operation 1120, the tracking component 270 determines one or morepositions for the one or more graphical interface elements. Thepositions may be determined with respect to the set of coordinatesdetermined for an area, such as coordinates determined in a first frameof the video stream.

In operation 1130, the tracking component 270 identifies a change inangular position of the client device between the first frame and asubsequent frame. For example, where the client device 110 is rotated toa different horizontal angular position, the components of the augmentedreality system 160 may track movement of the set of coordinates togenerate the graphical elements at the same coordinates as the graphicalelements occupied in one or more previous frames.

In operation 1140, the generation component 240 generates one or moresecond positions for the one or more graphical interface elements withrespect to the set of coordinates and based on the change in angularposition of the client device. The generation component 240 may alsogenerate the graphical elements in positions in the sky which remainrelatively or largely constant relative to the world when the clientdevice 110 turns. Where the client device 110 is moved to a positionwhere coordinates within the video stream are not included in thoseassociated with graphical elements, the augmented reality system 160 maycease presentation of one or more of the graphical elements. Forexample, the aurora may no longer be presented by one or more particlesmay on occasion appear within the video stream.

Modules, Components, and Logic

Certain embodiments are described herein as including logic or a numberof components, modules, or mechanisms. Components can constitutehardware components. A “hardware component” is a tangible unit capableof performing certain operations and can be configured or arranged in acertain physical manner. In various example embodiments, computersystems (e.g., a standalone computer system, a client computer system,or a server computer system) or hardware components of a computer system(e.g., at least one hardware processor, a processor, or a group ofprocessors) is configured by software (e.g., an application orapplication portion) as a hardware component that operates to performcertain operations as described herein.

In some embodiments, a hardware component is implemented mechanically,electronically, or any suitable combination thereof. For example, ahardware component can include dedicated circuitry or logic that ispermanently configured to perform certain operations. For example, ahardware component can be a special-purpose processor, such as aField-Programmable Gate Array (FPGA) or an Application SpecificIntegrated Circuit (ASIC). A hardware component may also includeprogrammable logic or circuitry that is temporarily configured bysoftware to perform certain operations. For example, a hardwarecomponent can include software encompassed within a general-purposeprocessor or other programmable processor. It will be appreciated thatthe decision to implement a hardware component mechanically, indedicated and permanently configured circuitry, or in temporarilyconfigured circuitry (e.g., configured by software) can be driven bycost and time considerations.

Accordingly, the phrase “hardware component” should be understood toencompass a tangible entity, be that an entity that is physicallyconstructed, permanently configured (e.g., hardwired), or temporarilyconfigured (e.g., programmed) to operate in a certain manner or toperform certain operations described herein. As used herein,“hardware-implemented component” refers to a hardware component.Considering embodiments in which hardware components are temporarilyconfigured (e.g., programmed), each of the hardware components need notbe configured or instantiated at any one instance in time. For example,where a hardware component comprises a general-purpose processorconfigured by software to become a special-purpose processor, thegeneral-purpose processor may be configured as respectively differentspecial-purpose processors (e.g., comprising different hardwarecomponents) at different times. Software can accordingly configure aparticular processor or processors, for example, to constitute aparticular hardware component at one instance of time and to constitutea different hardware component at a different instance of time.

Hardware components can provide information to, and receive informationfrom, other hardware components. Accordingly, the described hardwarecomponents can be regarded as being communicatively coupled. Wheremultiple hardware components exist contemporaneously, communications canbe achieved through signal transmission (e.g., over appropriate circuitsand buses) between or among two or more of the hardware components. Inembodiments in which multiple hardware components are configured orinstantiated at different times, communications between such hardwarecomponents may be achieved, for example, through the storage andretrieval of information in memory structures to which the multiplehardware components have access. For example, one hardware componentperforms an operation and stores the output of that operation in amemory device to which it is communicatively coupled. A further hardwarecomponent can then, at a later time, access the memory device toretrieve and process the stored output. Hardware components can alsoinitiate communications with input or output devices, and can operate ona resource (e.g., a collection of information).

The various operations of example methods described herein can beperformed, at least partially, by processors that are temporarilyconfigured (e.g., by software) or permanently configured to perform therelevant operations. Whether temporarily or permanently configured, suchprocessors constitute processor-implemented components that operate toperform operations or functions described herein. As used herein,“processor-implemented component” refers to a hardware componentimplemented using processors.

Similarly, the methods described herein can be at least partiallyprocessor-implemented, with a particular processor or processors beingan example of hardware. For example, at least some of the operations ofa method can be performed by processors or processor-implementedcomponents. Moreover, the processors may also operate to supportperformance of the relevant operations in a “cloud computing”environment or as a “software as a service” (SaaS). For example, atleast some of the operations may be performed by a group of computers(as examples of machines including processors), with these operationsbeing accessible via a network (e.g., the Internet) and via appropriateinterfaces (e.g., an Application Program Interface (API)).

The performance of certain of the operations may be distributed amongthe processors, not only residing within a single machine, but deployedacross a number of machines. In some example embodiments, the processorsor processor-implemented components are located in a single geographiclocation (e.g., within a home environment, an office environment, or aserver farm). In other example embodiments, the processors orprocessor-implemented components are distributed across a number ofgeographic locations.

Applications

FIG. 12 illustrates an example mobile device 1200 executing a mobileoperating system (e.g., IOS™, ANDROID™, WINDOWS® Phone, or other mobileoperating systems), consistent with some embodiments. In one embodiment,the mobile device 1200 includes a touch screen operable to receivetactile data from a user 1202. For instance, the user 1202 mayphysically touch 1204 the mobile device 1200, and in response to thetouch 1204, the mobile device 1200 may determine tactile data such astouch location, touch force, or gesture motion. In various exampleembodiments, the mobile device 1200 displays a home screen 1206 (e.g.,Springboard on IOS™) operable to launch applications or otherwise managevarious aspects of the mobile device 1200. In some example embodiments,the home screen 1206 provides status information such as battery life,connectivity, or other hardware statuses. The user 1202 can activateuser interface elements by touching an area occupied by a respectiveuser interface element. In this manner, the user 1202 interacts with theapplications of the mobile device 1200. For example, touching the areaoccupied by a particular icon included in the home screen 1206 causeslaunching of an application corresponding to the particular icon.

The mobile device 1200, as shown in FIG. 12 , includes an imaging device1208. The imaging device may be a camera or any other device coupled tothe mobile device 1200 capable of capturing a video stream or one ormore successive images. The imaging device 1208 may be triggered by theaugmented reality system 160 or a selectable user interface element toinitiate capture of a video stream or succession of frames and pass thevideo stream or succession of images to the augmented reality system 160for processing according to the one or more methods described in thepresent disclosure.

Many varieties of applications (also referred to as “apps”) can beexecuting on the mobile device 1200, such as native applications (e.g.,applications programmed in Objective-C, Swift, or another suitablelanguage running on IOS™, or applications programmed in Java running onANDROID™), mobile web applications (e.g., applications written inHypertext Markup Language-5 (HTML5)), or hybrid applications (e.g., anative shell application that launches an HTML5 session). For example,the mobile device 1200 includes a messaging app, an audio recording app,a camera app, a book reader app, a media app, a fitness app, a filemanagement app, a location app, a browser app, a settings app, acontacts app, a telephone call app, or other apps (e.g., gaming apps,social networking apps, biometric monitoring apps). In another example,the mobile device 1200 includes a social messaging app 1210 such asSNAPCHAT® that, consistent with some embodiments, allows users toexchange ephemeral messages that include media content. In this example,the social messaging app 1210 can incorporate aspects of embodimentsdescribed herein. For example, in some embodiments the social messagingapplication includes an ephemeral gallery of media created by users thesocial messaging application. These galleries may consist of videos orpictures posted by a user and made viewable by contacts (e.g.,“friends”) of the user. Alternatively, public galleries may be createdby administrators of the social messaging application consisting ofmedia from any users of the application (and accessible by all users).In yet another embodiment, the social messaging application may includea “magazine” feature which consists of articles and other contentgenerated by publishers on the social messaging application's platformand accessible by any users. Any of these environments or platforms maybe used to implement concepts of the present invention.

In some embodiments, an ephemeral message system may include messageshaving ephemeral video clips or images which are deleted following adeletion trigger event such as a viewing time or viewing completion. Insuch embodiments, a device implementing the augmented reality system 160may identify, track, extract, and generate representations of a facewithin the ephemeral video clip, as the ephemeral video clip is beingcaptured by the device and transmit the ephemeral video clip to anotherdevice using the ephemeral message system.

Software Architecture

FIG. 13 is a block diagram 1300 illustrating an architecture of software1302, which can be installed on the devices described above. FIG. 13 ismerely a non-limiting example of a software architecture, and it will beappreciated that many other architectures can be implemented tofacilitate the functionality described herein. In various embodiments,the software 1302 is implemented by hardware such as machine a 1400 ofFIG. 14 that includes processors 1410, memory 1430, and I/O components1450. In this example architecture, the software 1302 can beconceptualized as a stack of layers where each layer may provide aparticular functionality. For example, the software 1302 includes layerssuch as an operating system 1304, libraries 1306, frameworks 1308, andapplications 1310. Operationally, the applications 1310 invokeapplication programming interface (API) calls 1312 through the softwarestack and receive messages 1314 in response to the API calls 1312,consistent with some embodiments.

In various implementations, the operating system 1304 manages hardwareresources and provides common services. The operating system 1304includes, for example, a kernel 1320, services 1322, and drivers 1324.The kernel 1320 acts as an abstraction layer between the hardware andthe other software layers consistent with some embodiments. For example,the kernel 1320 provides memory management, processor management (e.g.,scheduling), component management, networking, and security settings,among other functionality. The services 1322 can provide other commonservices for the other software layers. The drivers 1324 are responsiblefor controlling or interfacing with the underlying hardware, accordingto some embodiments. For instance, the drivers 1324 can include displaydrivers, camera drivers, BLUETOOTH® drivers, flash memory drivers,serial communication drivers (e.g., Universal Serial Bus (USB) drivers),WI-FI@ drivers, audio drivers, power management drivers, and so forth.

In some embodiments, the libraries 1306 provide a low-level commoninfrastructure utilized by the applications 1310. The libraries 1306 caninclude system libraries 1330 (e.g., C standard library) that canprovide functions such as memory allocation functions, stringmanipulation functions, mathematic functions, and the like. In addition,the libraries 1306 can include API libraries 1332 such as medialibraries (e.g., libraries to support presentation and manipulation ofvarious media formats such as Moving Picture Experts Group-4 (MPEG4),Advanced Video Coding (H.264 or AVC), Moving Picture Experts GroupLayer-3 (MP3), Advanced Audio Coding (AAC), Adaptive Multi-Rate (AMR)audio codec, Joint Photographic Experts Group (JPEG or JPG), or PortableNetwork Graphics (PNG)), graphics libraries (e.g., an OpenGL frameworkused to render in two dimensions (2D) and three dimensions (3D) in agraphic content on a display), database libraries (e.g., SQLite toprovide various relational database functions), web libraries (e.g.,WebKit to provide web browsing functionality), and the like. Thelibraries 1306 can also include a wide variety of other libraries 1334to provide many other APIs to the applications 1310.

The frameworks 1308 provide a high-level common infrastructure that canbe utilized by the applications 1310, according to some embodiments. Forexample, the frameworks 1308 provide various graphic user interface(GUI) functions, high-level resource management, high-level locationservices, and so forth. The frameworks 1308 can provide a broad spectrumof other APIs that can be utilized by the applications 1310, some ofwhich may be specific to a particular operating system or platform.

In an example embodiment, the applications 1310 include a homeapplication 1350, a contacts application 1352, a browser application1354, a book reader application 1356, a location application 1358, amedia application 1360, a messaging application 1362, a game application1364, and a broad assortment of other applications such as a third partyapplication 1366. According to some embodiments, the applications 1310are programs that execute functions defined in the programs. Variousprogramming languages can be employed to create the applications 1310,structured in a variety of manners, such as object-oriented programminglanguages (e.g., Objective-C, Java, or C++) or procedural programminglanguages (e.g., C or assembly language). In a specific example, thethird party application 1366 (e.g., an application developed using theANDROID™ or IOS™ software development kit (SDK) by an entity other thanthe vendor of the particular platform) may be mobile software running ona mobile operating system such as IOS™, ANDROID™, WINDOWS® PHONE, oranother mobile operating systems. In this example, the third partyapplication 1366 can invoke the API calls 1312 provided by the operatingsystem 1304 to facilitate functionality described herein.

Example Machine Architecture and Machine-Readable Medium

FIG. 14 is a block diagram illustrating components of a machine 1400,according to some embodiments, able to read instructions (e.g.,processor executable instructions) from a machine-readable medium (e.g.,a non-transitory processor-readable storage medium or processor-readablestorage device) and perform any of the methodologies discussed herein.Specifically, FIG. 14 shows a diagrammatic representation of the machine1400 in the example form of a computer system, within which instructions1416 (e.g., software, a program, an application, an applet, an app, orother executable code) for causing the machine 1400 to perform any ofthe methodologies discussed herein can be executed. In alternativeembodiments, the machine 1400 operates as a standalone device or can becoupled (e.g., networked) to other machines. In a networked deployment,the machine 1400 may operate in the capacity of a server machine or aclient machine in a server-client network environment, or as a peermachine in a peer-to-peer (or distributed) network environment. Themachine 1400 can comprise, but not be limited to, a server computer, aclient computer, a personal computer (PC), a tablet computer, a laptopcomputer, a netbook, a set-top box (STB), a personal digital assistant(PDA), an entertainment media system, a cellular telephone, a smartphone, a mobile device, a wearable device (e.g., a smart watch), a smarthome device (e.g., a smart appliance), other smart devices, a webappliance, a network router, a network switch, a network bridge, or anymachine capable of executing the instructions 1416, sequentially orotherwise, that specify actions to be taken by the machine 1400.Further, while only a single machine 1400 is illustrated, the term“machine” shall also be taken to include a collection of machines 1400that individually or jointly execute the instructions 1416 to performany of the methodologies discussed herein.

In various embodiments, the machine 1400 comprises processors 1410,memory 1430, and I/O components 1450, which can be configured tocommunicate with each other via a bus 1402. In an example embodiment,the processors 1410 (e.g., a Central Processing Unit (CPU), a ReducedInstruction Set Computing (RISC) processor, a Complex Instruction SetComputing (CISC) processor, a Graphics Processing Unit (GPU), a DigitalSignal Processor (DSP), an Application Specific Integrated Circuit(ASIC), a Radio-Frequency Integrated Circuit (RFIC), another processor,or any suitable combination thereof) includes, for example, a processor1412 and a processor 1414 that may execute the instructions 1416. Theterm “processor” is intended to include multi-core processors that maycomprise two or more independent processors (also referred to as“cores”) that can execute instructions contemporaneously. Although FIG.14 shows multiple processors, the machine 1400 may include a singleprocessor with a single core, a single processor with multiple cores(e.g., a multi-core processor), multiple processors with a single core,multiple processors with multiples cores, or any combination thereof.

The memory 1430 comprises a main memory 1432, a static memory 1434, anda storage unit 1436 accessible to the processors 1410 via the bus 1402,according to some embodiments. The storage unit 1436 can include amachine-readable medium 1438 on which are stored the instructions 1416embodying any of the methodologies or functions described herein. Theinstructions 1416 can also reside, completely or at least partially,within the main memory 1432, within the static memory 1434, within atleast one of the processors 1410 (e.g., within the processor's cachememory), or any suitable combination thereof, during execution thereofby the machine 1400. Accordingly, in various embodiments, the mainmemory 1432, the static memory 1434, and the processors 1410 areconsidered machine-readable media 1438.

As used herein, the term “memory” refers to a machine-readable medium1438 able to store data temporarily or permanently and may be taken toinclude, but not be limited to, random-access memory (RAM), read-onlymemory (ROM), buffer memory, flash memory, and cache memory. While themachine-readable medium 1438 is shown in an example embodiment to be asingle medium, the term “machine-readable medium” should be taken toinclude a single medium or multiple media (e.g., a centralized ordistributed database, or associated caches and servers) able to storethe instructions 1416. The term “machine-readable medium” shall also betaken to include any medium, or combination of multiple media, that iscapable of storing instructions (e.g., instructions 1416) for executionby a machine (e.g., machine 1400), such that the instructions, whenexecuted by processors of the machine 1400 (e.g., processors 1410),cause the machine 1400 to perform any of the methodologies describedherein. Accordingly, a “machine-readable medium” refers to a singlestorage apparatus or device, as well as “cloud-based” storage systems orstorage networks that include multiple storage apparatus or devices. Theterm “machine-readable medium” shall accordingly be taken to include,but not be limited to, data repositories in the form of a solid-statememory (e.g., flash memory), an optical medium, a magnetic medium, othernon-volatile memory (e.g., Erasable Programmable Read-Only Memory(EPROM)), or any suitable combination thereof. The term“machine-readable medium” specifically excludes non-statutory signalsper se.

The I/O components 1450 include a wide variety of components to receiveinput, provide output, produce output, transmit information, exchangeinformation, capture measurements, and so on. In general, it will beappreciated that the I/O components 1450 can include many othercomponents that are not shown in FIG. 14 . The I/O components 1450 aregrouped according to functionality merely for simplifying the followingdiscussion, and the grouping is in no way limiting. In various exampleembodiments, the I/O components 1450 include output components 1452 andinput components 1454. The output components 1452 include visualcomponents (e.g., a display such as a plasma display panel (PDP), alight emitting diode (LED) display, a liquid crystal display (LCD), aprojector, or a cathode ray tube (CRT)), acoustic components (e.g.,speakers), haptic components (e.g., a vibratory motor), other signalgenerators, and so forth. The input components 1454 include alphanumericinput components (e.g., a keyboard, a touch screen configured to receivealphanumeric input, a photo-optical keyboard, or other alphanumericinput components), point based input components (e.g., a mouse, atouchpad, a trackball, a joystick, a motion sensor, or other pointinginstruments), tactile input components (e.g., a physical button, a touchscreen that provides location and force of touches or touch gestures, orother tactile input components), audio input components (e.g., amicrophone), and the like.

In some further example embodiments, the I/O components 1450 includebiometric components 1456, motion components 1458, environmentalcomponents 1460, or position components 1462, among a wide array ofother components. For example, the biometric components 1456 includecomponents to detect expressions (e.g., hand expressions, facialexpressions, vocal expressions, body gestures, or mouth gestures),measure biosignals (e.g., blood pressure, heart rate, body temperature,perspiration, or brain waves), identify a person (e.g., voiceidentification, retinal identification, facial identification,fingerprint identification, or electroencephalogram basedidentification), and the like. The motion components 1458 includeacceleration sensor components (e.g., accelerometer), gravitation sensorcomponents, rotation sensor components (e.g., gyroscope), and so forth.The environmental components 1460 include, for example, illuminationsensor components (e.g., photometer), temperature sensor components(e.g., thermometers that detect ambient temperature), humidity sensorcomponents, pressure sensor components (e.g., barometer), acousticsensor components (e.g., microphones that detect background noise),proximity sensor components (e.g., infrared sensors that detect nearbyobjects), gas sensor components (e.g., machine olfaction detectionsensors, gas detection sensors to detect concentrations of hazardousgases for safety or to measure pollutants in the atmosphere), or othercomponents that may provide indications, measurements, or signalscorresponding to a surrounding physical environment. The positioncomponents 1462 include location sensor components (e.g., a GlobalPositioning System (GPS) receiver component), altitude sensor components(e.g., altimeters or barometers that detect air pressure from whichaltitude may be derived), orientation sensor components (e.g.,magnetometers), and the like.

Communication can be implemented using a wide variety of technologies.The I/O components 1450 may include communication components 1464operable to couple the machine 1400 to a network 1480 or devices 1470via a coupling 1482 and a coupling 1472, respectively. For example, thecommunication components 1464 include a network interface component oranother suitable device to interface with the network 1480. In furtherexamples, communication components 1464 include wired communicationcomponents, wireless communication components, cellular communicationcomponents, Near Field Communication (NFC) components, BLUETOOTH®components (e.g., BLUETOOTH® Low Energy), WI-FI® components, and othercommunication components to provide communication via other modalities.The devices 1470 may be another machine or any of a wide variety ofperipheral devices (e.g., a peripheral device coupled via a UniversalSerial Bus (USB)).

Moreover, in some embodiments, the communication components 1464 detectidentifiers or include components operable to detect identifiers. Forexample, the communication components 1464 include Radio FrequencyIdentification (RFID) tag reader components, NFC smart tag detectioncomponents, optical reader components (e.g., an optical sensor to detecta one-dimensional bar codes such as a Universal Product Code (UPC) barcode, multi-dimensional bar codes such as a Quick Response (QR) code,Aztec Code, Data Matrix, Dataglyph, MaxiCode, PDF417, Ultra Code,Uniform Commercial Code Reduced Space Symbology (UCC RSS)-2D bar codes,and other optical codes), acoustic detection components (e.g.,microphones to identify tagged audio signals), or any suitablecombination thereof. In addition, a variety of information can bederived via the communication components 1464, such as location viaInternet Protocol (IP) geo-location, location via WI-FI® signaltriangulation, location via detecting a BLUETOOTH® or NFC beacon signalthat may indicate a particular location, and so forth.

Transmission Medium

In various example embodiments, portions of the network 1480 can be anad hoc network, an intranet, an extranet, a virtual private network(VPN), a local area network (LAN), a wireless LAN (WLAN), a wide areanetwork (WAN), a wireless WAN (WWAN), a metropolitan area network (MAN),the Internet, a portion of the Internet, a portion of the PublicSwitched Telephone Network (PSTN), a plain old telephone service (POTS)network, a cellular telephone network, a wireless network, a WI-FI®network, another type of network, or a combination of two or more suchnetworks. For example, the network 1480 or a portion of the network 1480may include a wireless or cellular network, and the coupling 1482 may bea Code Division Multiple Access (CDMA) connection, a Global System forMobile communications (GSM) connection, or another type of cellular orwireless coupling. In this example, the coupling 1482 can implement anyof a variety of types of data transfer technology, such as SingleCarrier Radio Transmission Technology (1×RTT), Evolution-Data Optimized(EVDO) technology, General Packet Radio Service (GPRS) technology,Enhanced Data rates for GSM Evolution (EDGE) technology, thirdGeneration Partnership Project (3GPP) including 3G, fourth generationwireless (4G) networks, Universal Mobile Telecommunications System(UMTS), High Speed Packet Access (HSPA), Worldwide Interoperability forMicrowave Access (WiMAX), Long Term Evolution (LTE) standard, othersdefined by various standard-setting organizations, other long rangeprotocols, or other data transfer technology.

In example embodiments, the instructions 1416 are transmitted orreceived over the network 1480 using a transmission medium via a networkinterface device (e.g., a network interface component included in thecommunication components 1464) and utilizing any one of a number ofwell-known transfer protocols (e.g., Hypertext Transfer Protocol(HTTP)). Similarly, in other example embodiments, the instructions 1416are transmitted or received using a transmission medium via the coupling1472 (e.g., a peer-to-peer coupling) to the devices 1470. The term“transmission medium” shall be taken to include any intangible mediumthat is capable of storing, encoding, or carrying the instructions 1416for execution by the machine 1400, and includes digital or analogcommunications signals or other intangible media to facilitatecommunication of such software.

Furthermore, the machine-readable medium 1438 is non-transitory (inother words, not having any transitory signals) in that it does notembody a propagating signal. However, labeling the machine-readablemedium 1438 “non-transitory” should not be construed to mean that themedium is incapable of movement; the medium should be considered asbeing transportable from one physical location to another. Additionally,since the machine-readable medium 1438 is tangible, the medium may beconsidered to be a machine-readable device.

Language

Throughout this specification, plural instances may implementcomponents, operations, or structures described as a single instance.Although individual operations of methods are illustrated and describedas separate operations, individual operations may be performedconcurrently, and nothing requires that the operations be performed inthe order illustrated. Structures and functionality presented asseparate components in example configurations may be implemented as acombined structure or component. Similarly, structures and functionalitypresented as a single component may be implemented as separatecomponents. These and other variations, modifications, additions, andimprovements fall within the scope of the subject matter herein.

Although an overview of the inventive subject matter has been describedwith reference to specific example embodiments, various modificationsand changes may be made to these embodiments without departing from thebroader scope of embodiments of the present disclosure. Such embodimentsof the inventive subject matter may be referred to herein, individuallyor collectively, by the term “invention” merely for convenience andwithout intending to voluntarily limit the scope of this application toany single disclosure or inventive concept if more than one is, in fact,disclosed.

The embodiments illustrated herein are described in sufficient detail toenable those skilled in the art to practice the teachings disclosed.Other embodiments may be used and derived therefrom, such thatstructural and logical substitutions and changes may be made withoutdeparting from the scope of this disclosure. The Detailed Description,therefore, is not to be taken in a limiting sense, and the scope ofvarious embodiments is defined only by the appended claims, along withthe full range of equivalents to which such claims are entitled.

As used herein, the term “or” may be construed in either an inclusive orexclusive sense. Moreover, plural instances may be provided forresources, operations, or structures described herein as a singleinstance. Additionally, boundaries between various resources,operations, components, engines, and data stores are somewhat arbitrary,and particular operations are illustrated in a context of specificillustrative configurations. Other allocations of functionality areenvisioned and may fall within a scope of various embodiments of thepresent disclosure. In general, structures and functionality presentedas separate resources in the example configurations may be implementedas a combined structure or resource. Similarly, structures andfunctionality presented as a single resource may be implemented asseparate resources. These and other variations, modifications,additions, and improvements fall within a scope of embodiments of thepresent disclosure as represented by the appended claims. Thespecification and drawings are, accordingly, to be regarded in anillustrative rather than a restrictive sense.

What is claimed is:
 1. A method, comprising: identifying one or moregraphical interface elements; in response to identifying the one or moregraphical interface elements, adding the one or more graphical interfaceelements to a first frame of a video to modify the first frame of thevideo to include the identified one or more graphical interfaceelements; generating a display of the identified one or more graphicalinterface elements over visual content depicted in the first frame ofthe video, the one or more graphical interface elements comprising afirst portion and a second portion, wherein the first portion ispersistently displayed, and wherein the second portion is temporarilydisplayed for a predetermined period of time together with the firstportion; moving the second portion of the one or more graphicalinterface elements along a path at a specified speed of movement, themoving comprising moving the second portion of the one or more graphicalinterface elements that have been added to the first frame to a newposition in a second frame of the video determined based on the path andthe speed of movement; and at expiry of the predetermined period oftime, ceasing display of the second portion while maintaining display ofthe first portion.
 2. The method of claim 1, further comprising:receiving audio data, wherein the one or more graphical interfaceelements correspond to the audio data; and detecting one or more audiocharacteristics within the audio data of a video stream, wherein the oneor more graphical interface elements are identified based on thedetected one or more audio characteristics.
 3. The method of claim 1,wherein the identified one or more graphical interface elements aregenerated for display while one or more audio characteristics continueto be detected within audio data associated with multiple frames of avideo stream, and wherein the one or more graphical interface elementsare displayed in a first portion of the video stream.
 4. The method ofclaim 1, further comprising: detecting a change in audio data within avideo stream; and in response to detecting the change in the audio data,modifying the one or more graphical interface elements displayed in asubsequent portion of the video stream by gradually changing a visualfeature of at least one of the one or more graphical interface elementsfrom a first visual feature to a second visual feature.
 5. The method ofclaim 1, wherein one or more audio characteristics detected within audiodata include volume or frequency, and wherein the second portion of theone or more graphical interface elements fade into view above a persondepicted within a video stream that includes the audio data.
 6. Themethod of claim 1, wherein the one or more graphical interface elementsare generated for display in a first color, further comprising modifyingthe one or more graphical interface elements by gradually changing acolor of at least one of the one or more graphical interface elementsfrom the first color to a second color in response to detecting a changein volume or frequency of audio data.
 7. The method of claim 1, furthercomprising: generating a first graphical interface element, the firstgraphical interface element being generated with a set of firstbehaviors associated with a first audio characteristic; and generating aset of second graphical interface elements, the set of second graphicalinterface elements being generated with a set of second behaviorsassociated with a second audio characteristic.
 8. The method of claim 1,further comprising: determining one or more positions for the one ormore graphical interface elements with respect to a set of coordinatesin the first frame of a video stream; identifying a change in angularposition of a client device between the first frame and a subsequentframe; and generating one or more second positions for the one or moregraphical interface elements with respect to the set of coordinatesbased on the change in angular position of the client device.
 9. Themethod of claim 1, further comprising: tracking one or more positions ofthe one or more graphical interface elements across one or more framesof a video stream.
 10. The method of claim 1, further comprising:detecting a change in orientation of a client device from which audiodata is received; determining that a user input has been performed onthe client device for a threshold period of time; and in response todetecting the change in orientation and determining that the user inputhas been performed for the threshold period of time, initiatinggeneration of the display of the identified one or more graphicalinterface elements.
 11. The method of claim 10, wherein the user inputcomprises a press and hold operation on a touchscreen of the clientdevice, and wherein the change in orientation comprises moving theclient device to point towards a sky.
 12. A device comprising: one ormore processors; and computer readable storage medium comprisinginstructions that, when executed by the one or more processors of adevice, cause the one or more processors to perform operationscomprising: identifying one or more graphical interface elements; inresponse to identifying the one or more graphical interface elements,adding the one or more graphical interface elements to a first frame ofa video to modify the first frame of the video to include the identifiedone or more graphical interface elements; generating a display of theidentified one or more graphical interface elements over visual contentdepicted in the first frame of the video, the one or more graphicalinterface elements comprising a first portion and a second portion,wherein the first portion is persistently displayed, and wherein thesecond portion is temporarily displayed for a predetermined period oftime together with the first portion; moving the second portion of theone or more graphical interface elements along a path at a specifiedspeed of movement, the moving comprising moving the second portion ofthe one or more graphical interface elements that have been added to thefirst frame to a new position in a second frame of the video determinedbased on the path and the speed of movement; and at expiry of thepredetermined period of time, ceasing display of the second portionwhile maintaining display of the first portion.
 13. The device of claim12, wherein one or more audio characteristics detected within audio datainclude volume or frequency, and wherein the second portion of the oneor more graphical interface elements fade into view above a persondepicted within a video stream that includes the audio data.
 14. Thedevice of claim 12, wherein the one or more graphical interface elementsare generated for display in a first color, and wherein the operationsfurther comprise modifying the one or more graphical interface elementsby gradually changing a color of at least one of the one or moregraphical interface elements from the first color to a second color inresponse to detecting a change in a volume or frequency in audio data.15. The device of claim 12, wherein the operations further comprise:generating a first graphical interface element, the first graphicalinterface element being generated with a set of first behaviorsassociated with a first audio characteristic; and generating a set ofsecond graphical interface elements, the set of second graphicalinterface elements being generated with a set of second behaviorsassociated with a second audio characteristic.
 16. The device of claim12, wherein the operations further comprise: determining one or morepositions for the one or more graphical interface elements with respectto a set of coordinates in the first frame of a video stream;identifying a change in angular position of a client device between thefirst frame and a subsequent frame; and generating one or more secondpositions for the one or more graphical interface elements with respectto the set of coordinates based on the change in angular position of theclient device.
 17. The device of claim 12, wherein the operationsfurther comprise tracking one or more positions of the one or moregraphical interface elements across one or more frames of a videostream.
 18. A non-transitory computer readable storage medium comprisingnon- transitory instructions that, when executed by one or moreprocessors of a device, cause the device to perform operationscomprising: identifying one or more graphical interface elements; inresponse to identifying the one or more graphical interface elements,adding the one or more graphical interface elements to a first frame ofa video to modify the first frame of the video to include the identifiedone or more graphical interface elements; generating a display of theidentified one or more graphical interface elements over visual contentdepicted in the first frame of the video, the one or more graphicalinterface elements comprising a first portion and a second portion,wherein the first portion is persistently displayed, and wherein thesecond portion is temporarily displayed for a predetermined period oftime together with the first portion; moving the second portion of theone or more graphical interface elements along a path at a specifiedspeed of movement, the moving comprising moving the second portion ofthe one or more graphical interface elements that have been added to thefirst frame to a new position in a second frame of the video determinedbased on the path and the speed of movement; and at expiry of thepredetermined period of time, ceasing display of the second portionwhile maintaining display of the first portion.